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Abstract 

This  paper  abandons  the  universal  assumption  in  models  of  observational 
learning  that  individuals  can  perfectly  observe  the  order  of  past  decisions.  Only 
a  worked  example  so  far,  we  show  that  complete  learning  obtains  under  the 
same  conditions  as  before.  We  try  to  underscore  that  the  resulting  analysis  in 
no  way  exploits  any  martingales  —  which  have  been  the  lifeblood  of  optimal 
learning  theory  until  now. 


"This  is  very  preliminary  and  incomplete,  and  only  a  note  in  its  present  form.  The  authors  solely  wish 
to  record  the  example  that  will  be  the  basis  for  a  paper  once  we  figure  out  exactly  what  is  happening. 
When  completed,  this  paper  will  supersede  "Error  Persistence,  and  Experiential  versus  Observational 
Learning,"  which  was  presented  on  the  Review  of  Economic  Studies  Tour  of  London,  Brussels,  ajid  Tel 
Aviv  in  1991.  The  reincarnation  of  this  paper  as  joint  work  partially  reflects  some  especially  fruitful 
conversations  with  Mathias  Dewatripont  and  Guy  Laroque. 
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1.  INTRODUCTION 

Consider  the  following  canonical  model:  A  sequence  of  individuals  each  must  make 
a  once-in-a-lifetime  binary  decision,  with  common  payoff  either  1  (in  the  high  state)  or 
—  1  (in  the  low  state)  with  equal  chance.  Everyone  must  decide  on  the  basis  of  his  own 
private  signal  and  the  knowledge  of  what  all  predecessors  have  done.  Banerjee  (1992) 
and  Bikhchandani,  Hirshleifer  and  W^lch  (1992)  (henceforth,  BHW)  independently  no- 
ticed that  'herding'  may  arise  in  the  above  observational  learning  model:  There  is  a  pos- 
itive chance  that  everyone  eventually  settles  upon  the  less  profitable  decision.  Smith  and 
Sorensen  (1994)  (hereafter  SS)  have  recently  shown,  among  other  things,  that  the  'herding' 
results  of  Banerjee  (1992)  and  BHW  obtain  precisely  when  the  signals  possessed  by  the 
individuals  have  bounded  informativeness.  Otherwise,  barring  other  innovations  in  the 
model,  complete  learning  obtains. 

A  crucial  and  perhaps  untenable  assumption  in  the  whole  'herding'  literature  is  the 
perfect  observability  of  history,  and  in  particular,  knowledge  of  the  exact  order  of  all 
decisions:  A  modelling  assumption  to  be  sure,  but  how  important  is  it?  Smith  (1991)  thus 
posited  that  entrants  can  only  observe  the  number  of  individuals  choosing  each  option, 
and  nothing  else.^  In  this  discrete  player  model, ■^  we  conjecture  (and  have  almost  proved) 
that  complete  learning  arises  under  the  same  assumption  of  'unbounded  signals'  as  in  SS. 
Knowing  the  exact  order  of  all  prior  decisions  is  thus  not  asymptotically  critical. 

But  this  variation  on  the  standard  observational  learning  model  in  itself  does  not  justify 
a  heavy  investment  of  effort,  and  we  have  no  intention  of  making  a  curious  commentary 
on  the  herding  literature.  Rather,  we  are  hoping  to  push  the  understanding  of  how  social 
learning  occurs  in  whole  new  theoretical  direction.  For  unlike  all  other  'herding'  papers, 
the  entire  single-person  learning  literature  (cf.  Easley  and  Kiefer  (1988)),  and  the  rational 
expectations  literature  (cf.  Bray  and  Kreps  (1987)),  what  really  drives  convergence  here  is 
no  longer  the  Martingale  Convergence  Theorem.  For  while  individual  2  will  always  be  able 
to  infer  the  action  of  his  predecessor,  if  he  then  takes  the  opposite  action  of  individual  1, 
then  individual  3  will  not  be  able  to  infer  who  did  what:  In  this  sense,  information  is  both 
gained  and  lost,  and  thus  not  refined,  over  time.  Thus,  in  this  worked  example,  where 
the  analytics  are  especially  clean,  we  try  to  show  that  there  is  no  (non-trivial)  conditional 
or  unconditional  martingale  to  found.  While  we  wish  to  solve  the  model,  rather  than 
explain  how  not  solve  it,  we  do  feel  defensive  on  this  point.  We  do  discover  and  exploit 
a  supermartingale,  but  this  only  turns  out  to  be  the  more  modest  ingredient  in  a  very 
complex  picture.  We  then  prove  that  full  learning  obtains  in  the  limit.  We  have  some 
,  simulations  of  the  dynamics  of  the  above  learning  model  that  attest  to  the  nontrivial 
nature  of  what  is  going  on  here. 

We  eventually  wish  to  use  this  result  to  develop  a  more  encompassing  economic  model 
of  learning,  in  which  information  is  both  gained  and  lost. 


^Smith  (1991)  embellished  the  analysis  by  assuming  individuals  played  'two-armed  bandits'  following 
their  observation  of  history.  But  the  resulting  'error  persistence'  (incomplete  learning)  can  just  as  easily 
be  proxied  by  the  possibility  that  individuals  sometimes  err  in  their  one-shot  decisions. 

-The  more  tractable  continuum  player  analysis  in  Smith  (1991)  conceals  exactly  why  the  learning  is 
complete,  and  whether  this  is  simply  an  artifact  of  the  continuum  players  assumption. 


2.   THE  EXAMPLE 

2.1  Private  Information  and  Prior  Beliefs 

An  infinite  sequence  of  individuals  n  —  1,2 sequentially  takes  actions  in  that  ex- 
ogenous order.  Each  individual  can  choose  from  one  of  two  actions. 

We  first  introduce  a  background  probability  space  (Q,£,i/).  This  space  underfies  all 
random  processes  in  the  model,  and  is  assumed  to  be  common  knowledge. 

There  are  two  states  of  the  world  (or  more  simply,  states),  labelled  H  ('high')  and  L 
("low').  Formally,  this  means  that  the  background  state  space  Q  is  partitioned  into  two 
events  Q^  and  Q^,  called  H  and  L.  Let  the  common  prior  belief  be  that  i^{H)  =  i/{L)  = 
1/2.  That  individuals  have  common  priors  is  a  standard  modelling  assumption,  see  e.g. 
Harsanyi  (1967-68).  Also,  flat  priors  are  truly  WLOG,  for  SS  show  that  more  general 
priors  are  formally  equivalent  to  a  renormalization  of  the  payoffs.  Individual  n  receives  a 
private  random  signal,  (t„  G  D,  about  the  state  of  the  world. 

Conditional  on  the  state,  the  signals  are  assumed  to  be  i.i.d.  across  individuals.  It  is 
common  knowledge  that  in  state  H  (resp.  state  L),  the  signal  is  distributed  according  to 
the  probability  measure  fi"  (resp.  /i^).  In  the  specific  example  that  we  consider,  /i^  and 
/i^'  are  described  by  densities  on  (0,1),  ij.^  has  the  density  g{(r)  =:  (1  —  y/a)/s/a  while 
H^  has  the  density  1.  Following  his  observation  of  a,  the  individual  applies  Bayes'  rule  to 
deduce  what  we  shall  refer  to  as  his  private  belief  p{a)  —  g{cr)/{g{a)  + 1)  =  1  —  ^/a  G  (0, 1) 
that  the  state  is  H.  Conditional  on  the  state,  private  beliefs  are  i.i.d.  across  individuals 
because  signals  are.  In  state  H  (resp.  state  L),  p  is  distributed  with  a  c.d.f.  F"  (resp. 
c.d.f.  F^)  on  (0,1). 

This  may  look  very  capricious,  but  there  is  method  to  our  madness.  Imagine  that 
individuals  are  told  one  of  two  possible  statistically  true  statements  "with  chance  p,  the 
state  is  high/low",  where  the  signal  quality  p  is  uniform  on  (0, 1).  After  being  told  that 
the  state  is  high  (resp.  low),  the  individuaFs  posterior  equals  p  (resp.  1  -  p),  since  his 
prior  was  flat.  Thus,  in  the  high  state,  the  density  of  individuals  with  posterior  behef 
p  is  f"{p)  =  p/[p  -I-  (1  —  p)]  =  p,  yielding  F"{p)  =  p^.  Similarly,  in  the  low  state, 
F^{p)  =  2p  —  p^,  for  all  p  e  (0, 1).  This  story  of  binary  signals  (i.e.  being  told  'high'  or 
'low')  with  variable  certitude  is  less  general  than  an  arbitrary  state-dependent  signal  with 
support  E,  since  it  implies  the  symmetry  property  that  we  exploit  below. 

We  see  that  in  the  terminology  of  SS,  the  beliefs  are  unbounded,  i.e.  the  support 
of  F"  and  F^  is  all  of  [0, 1].  The  intent  with  this  example  is  to  show  that  the  complete 
learning  result  from  SS  carries  over  to  this  framework,  even  though  the  'publicly'  observable 
information  is  not  refined  over  time. 

2.2  The  Dynamic  Behavior  of  Posterior  Beliefs 

We  assume  WLOG  that  action  1  (eg.,  Invest)  has  payoff  1  in  state  H,  and  payoff  —1 
in  state  L,  while  action  2  is  a  neutral  decision  (eg..  Don't  Invest)  having  payoff"  0  in  each 
of  the  two  states.  Before  deciding  upon  an  action,  the  individual  combines  his  sample 
information  with  his  private  signal,  and  arrives  at  his  private  posterior  belief  r  that  the 
state  is  H.  The  expected  value  of  action  1  is  thus  r-(l-r)  =  2r-l,  and  so  the  individual 


optimally  picks  action  1  exactly  when  r  >  1/2. 

There  is  really  an  inductive  aspect  to  the  derivation  of  the  posterior  beliefs.  We  could 
first  define  the  first  individual's  decision  rule  as  a  function  of  his  private  belief,  and  then 
describe  how  the  second  individual  bases  his  decision  on  his  private  belief  and  on  the 
observation  of  the  first  individual's  action,  and  so  on.  Figures  1  and  2  illustrate  the 
calculated  values  of  the  first  few  probabilities  and  thresholds  along  this  line.  But  instead 
we  shall  go  right  in  and  simply  say  that  individual  n  knows  the  decision  rules  of  all  the 
previous  individuals,  and  bases  his  own  decision  on  that  knowledge. 


Figure  1:  Private  Belief  Thresholds.  This  portrays  all  possible  values  of  the  threshold 
Pn{k)  for  the  first  four  individuals.  Here,  "Invest"  is  action  1,  and  "Don't"  is  action  2. 
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The  sample  information  privy  to  individual  n  can  be  conveniently  summarized  by  the 
number  of  predecessors  who  took  action  1.  More  formally,  we  shall  refer  to  hn{k)  as  the 
history  (observed  solely  by  individual  n)  that  k  of  the  first  n  —  1  agents  took  action  1  (i.e. 
invested).  Individual  n  can  then  calculate  the  various  possible  permutations  of  all  possible 
action  sequence.  As  individual  n  knows  how  his  predecessors'  decision  rule  as  a  function 
of  their  private  signals  and  the  history  they  could  observe,  he  is  able  to  calculate  the 


probability  of  each  of  their  actions  in  either  of  the  two  states,  under  each  of  the  possible 
action  sequence  permutations.  This  enables  him  to  calculate  the  ex  ante  chance  of  any 
observed  history  hn{k)  in  either  of  the  two  states:  we  shall  denote  these  probabilities  by 
~"(k)  and  Tr^ik)  (see  figure  2).  Finally,  individual  n  is  able  to  form  a  likelihood  ratio  that 
the  state  is  low,  namely  in{k)  =  7r^^{k)/7ij' (k).  When  k  is  not  specified,  we  shall  refer  to 
this  likelihood  ratio  as  ^„. 

Figure  2:  Probabilistic  Evolution.  This  illustrates  (at  the  tips  of  the  arrows)  the 
probabilities  i^Hik)  for  all  initial  histories  up  to  period  3.  Over  the  arrows  are  the  transition 
probabilities  in  state  H,  i.e.  1  —  F"{p)  of  "Invest"  and  F^^(p)  of  '"Don't",  by  equation  (2). 
Below  the  arrows  are  the  corresponding  transition  probabilities  in  state  L.  respectively 
F^ip)  and  I-  F^(p). 
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The  first  individual  only  observes  the  null  history,  and  so  we  shall  normalize  ^i(O)  =  1; 
later  on,  this  will  start  a  simple  inductive  derivation  of  the  likelihood  ratios.  As  private 
signals,  and  thereby  individual  actions,  are  random,  (^„)^i  is  a  stochastic  process.  Using 
the  likelihood  ratio  and  the  private  belief,  individual  n  may  form  his  posterior  belief  using 


Bayes'  rule.  In  state  H ,  this  is  given  by 

pnl^{k)  +  {l-p)n^{k)       p+(l-p)4(A;)  ^> 

Observe  that  this  expression  is  strictly  increasing  in  p.  We  may  interpret  (1)  as  simply 
saying  that  the  posterior  odds  (1  —  r)/r  of  state  L  equal  the  likelihood  ratio  times  the 
private  odds.  We  see  that  r  >  1/2  exactly  when 

We  shall  call  pn  the  public  beliefs  in  period  n,  since  this  would  be  individual  n's  posterior 
belief  (that  the  state  is  low)  had  he  a  purely  neutral  private  belief.^  Individual  n's  deci- 
sion rule  is  obvious:  He  takes  action  1  exactly  when  his  private  belief  p^  exceeds  p„,  or 
equivalently  when  his  posterior  belief  r^  exceeds  1/2. 

We  are  interested  in  the  asymptotic  behavior  of  beliefs  as  expressed  by  the  sequence 
of  likelihood  ratios  (£„),  or  public  beliefs  (pn)-  While  in  SS  the  likelihood  ratios  con- 
stitute a  martingale,  this  is  not  true  here.  Later  on,  we  explicitly  reject  this  and  other 
reasonable  candidates  for  martingales.  We  shall,  however,  exploit  the  fact  that  (p„)  is  a 
supermartingale  conditional  on  the  state. 

Particular  to  the  information  structure  of  this  example,  but  not  a  general  property 
of  the  observational  learning  paradigm,  is  a  key  symmetry  property:  F^(l  —  p)  =  1  — 
F^'{p).  Even  though  the  closed  form  solutions  will  repeatedly  exploit  this  fact,  we  are 
confident  that  it  is  not  the  driving  force  behind  the  complete  learning.  An  implication 
of  the  symmetric  posterior  distributions  are  symmetric  private  belief  thresholds,  and  a 
symmetric  probabilistic  evolution. 

Lemma  1  (Symmietry)  For  all  n  and  k  =  0, . . .  ,n  —  l,  we  havepn{k)  =  1— pn(n  — 1  —k) 
and  n"{k)  =  7r,^'(n  -  I-  k). 

Proof:  We  proceed  by  induction  on  n.  First  rrf  (0)  —  7rf  (0)  =  1  and  pi(0)  =  1/2,  and  so 
both  formulae  initially  obtain.  Let  N  >  I,  and  assume  that  both  equations  hold  for  all 
ri  <  N.  Let  us  now  establish  the  equations  for  period  N. 

Fix  any  sequence  of  choices  by  the  first  TV  —  1  individuals.  Individual  N  knows  exactly 
how  many  of  his  predecessors  invested,  and  considers  all  the  possible  orders  of  moves  that 
could  lead  to  this  outcome.  For  a  given  such  order,  let  k{n)  denote  how  many  of  the 
first  n  <  N  -  1  individuals  have  invested,  and  pn{k{n))  the  private  belief  threshold  for 
individual  n  <  N  —  1.  Denote  by  A  the  set  of  individuals  who  took  action  1  in  this 
particular  ordering  and  by  B  the  set  [1,. .  .,N  -  1}  \  ^.  Then  the  inductive  hypothesis 
and  F^(l  -  p)  =  1  -  F^{p)  yield 

Y[[l-F"{Pn{k{n)))]  n  F"{pm{k{m))) 

neA  meB 


^This  label,  taken  from  SS,  is  really  rather  inappropriate  here,  since  only  individual  7i  ever  sees  the 
public  belief  Pn. 


=     n  ^''(1  -  Pnik{n)))  n  [1  -  ^'(1  -  Pm{k{m)))] 

neA  meB 

=     n  F'^iPnin  -  1  -  k{n)))  [J  [1  -  F'^iPmim  -  1  -  k{w,)))] 

When  individual  N  sums  the  above  probabilities  corresponding  to  all  the  possible  sequences 
that  could  lead  to  the  outcome  at  hand,  he  will  clearly  discover  that  rr^-{k)  —  7r^{N  —  l-k). 
We  can  thus  conclude 

Application  of  (2)  then  yields  pn{k)  =  I  -  pn{n  —  I  —  k).  0 

Lemmas  1  and  2  easily  imply  the  relations 

^"^   '  n^ik)  +  TTJ^ik)       n^in  -  1  -  k)  +  n,^{k)  ^•'> 

We  now  turn  to  the  key  difference  equation  that  describes  the  evolution  of  the  system 
over  time.  When  individual  n  is  told  that  k  predecessors  took  action  1,  he  may  conclude 
that  one  of  two  scenarios  confronted  his  immediate  predecessor:  Either,  individual  n  —  1 
was  told  that  k  —  1  predecessors  had  taken  action  1,  and  he  then  took  action  2,  or  individual 
n  —  1  was  told  that  A;  predecessors  had  taken  action  1,  and  he  then  took  action  2.  This 
leads  to.  for  1  <  fc  <  n  —  2, 

7r„^(/c)  =  7r„^_i(A:  -  1)  [l  -  F" {Pn-i{k  -  1))]  +  <_aA;)F^(p-„_i(A;))  (5) 

One  could  assert  the  corresponding  relationship  in  state  L,  but  by  Lemma  1  that  is  already 
embodied  in  (5).  The  relation  also  vacuously  holds  when  A;  =  0  or  A;  =  n  —  1,  for  then 
one  of  the  terms  in  the  sum  vanishes.  It  is  equation  (5)  that  drives  the  dynamics  of  the 
model.  In  principle,  one  can  always  use  it  to  calculate  all  probabilities  (and  likelihoods 
and  thresholds)  for  the  model. 

2.3  Closed  Form  Solutions 

We  now  summarize  a  slough  of  calculations,  whose  derivations  are  relegated  to  the 
Appendix  (which  non-skeptical  readers  should  feel  free  to  skip).  The  result  describes  the 
probabilistic  evolution  of  the  model  in  state  H,  and  the  public  beliefs  that  arise  after  every 
historv. 


Lemma  2    For  all  n  >  1,  k  =  0, . . .  ,n  —  1  we  have 

2{n-k)  -  1 


(6) 

(7) 


One  can  verify  that  the  numbers  in  figures  1  and  2  agree  these  formulae. 

One  nice  property  of  the  example  is  that  we  can  explicitly  the  probability  that  everyone 
chooses  the  correct  action  from  the  very  outset.  For  the  chance  that  the  first  n  individuals 
take  the  correct  action  1  in  state  H  is 


(2n  +  l)2 

/2r7\ 

(2n  +  l) 

\    (2n.)!    1 
_22'i(n!)2 

2 

(271  +  1) 

■(2n-l)(2n-3)---3-l' 

2n(2ri 

_  2) ...  4 • 2 

[(2n  +  l)(2n  -  1)]  [(2(n  -  1)  +  l)(2(n  -  1)  -  1)] .  • .  [5  .  .3]  [3  •  1] 


(2n  •  2n)(2(n  -  1)  ■  2(n  -  1))  •  •  •  (4  •  4)(2  •  2) 


=  n 


4r 


j=i 


4f 


By  Wallis'  formula,  the  limit  of  this  expression  is 

if  -1^2 

TT 


n 


4r 


where  tt  =  3.14159 ...  is  the  well-known  constant.  That  is,  the  chance  that  all  individuals 
invest  m  state  H  is  'I/tt!  This  result  stands  in  contrast  to  the  fact  that  there  is  zero  chance 
that  all  individuals  will  take  the  wrong  action  in  state  H. 

3.  CONVERGENCE 


Unfortunately,  the  discrete  distribution  found  in  Lemma  2  is  unknown  (to  us!)  despite 
its  simple  form  and  similarity  to  the  hypergeometric  distribution.  But  we  can  calculate 
its  mean  and  prove  that  it  converges  to  0.  This  is  the  desired  limit,  for  a  zero  threshold 
implies  that  all  individuals  invest,  and  that  is  indeed  the  correct  action  in  state  H. 

Lemma  3  (Convergence  in  Mean)     In  state  H ,  the  stochastic  process  (pn)  converges 
to  0  in  mean. 

Proof:     For  any  n  >  2,  the  mean  of  Pn  satisfies "* 
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n  —  1  —  k    . 
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■* Below,  we  exploit  the  combinatorial  identity  4"  =  T,'k=o  Ck)C^n-i-k^)^  ^°^  ^^^^^  *'^^"^  several 
individuals  on  the  internet  newsgroup  sci.math. 


It  is  not  hard  to  prove  that  this  latter  expression  converges  to  0.  One  way  of  doing  so  is  as 
follows:  As  {Epn)  is  a  decreasing  sequence,  and  bounded  below  by  0.  it  converges  to  some 
nonnegative  limit.  If  its  limit  were  strictly  positive,  then  we  could  take  its  logarithm: 
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Hence.  p„  ^  0  in  mean. 

By  equations  (1)  and  (2),  we  have 


0 


r„  - 


P(l  -Pn] 


p(l  -Pn)+pn{l  -P) 


Thus,  by  Lemma  3,  the  posterior  belief  sequence  (r^)  converges  to  0  in  probability  by 
Chebyshev's  inequality.  Still,  we  cannot  yet  conclude  almost  sure  convergence  of  (r^).^ 
For  this,  we  think  that  we  must  use  the  Martingale  Convergence  Theorem.  Figure  3 
provides  a  flavor  of  the  nature  of  the  convergence  here. 


Figure  3:  Empirical  Density  Function. 
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^To  recall  why  this  is  so,  consider  the  following  well-known  example  (which  appears  for  instance  as 
Example  7.4  in  Bartle  (1966)).  Let  (A',i)^j  be  a  sequence  of  stochastic  variables  on  the  probability  space 
[0,1]  with  Lebesgue  measure.  To  any  given  n  >  1  choose  a{n)  such  that  2"'"'  <  n  <  2"'"'"''^.  Let  A'„ 
be  the  indicator  function  of  the  subinterval  [{n  -  2°(")}2-°("',  {n  +  1  -  2''("'}2-"("']  C  [0, 1].  Clearly, 
0  <  An  <  1,  and  EXn  =  2""'"'  ->  0;  thus,  .Y„  ->  0  in  mean  and  probabihty.  But,  there  is  no  w  €  [0, 1] 
for  which  A'„(u;)  -¥  0,  because  the  \'alue  1  will  pop  up  infinitely  often. 


Consider  the  set  of  all  possible  infinite  histories,  and  treat  a  finite  history  as  the  union  of 
all  infinite  histories  that  contain  it.  Endow  the  set  with  the  cr-algebra  A  that  is  generated 
by  all  possible  finite  length  histories.  All  infinite  length  histories  must  be  in  A  as  well, 
because  an  infinite  history  is  the  countable  intersection  of  finite  histories.  The  probability 
measure  on  A  is  that  of  p„,  so  that  history  /in(A;)  has  probability  -K^ik). 

Now  consider  the  sequence  of  sub-a-algebras  (>ln)^^,  where  An  is  generated  by  all 
finite  histories  of  length  n.  Note  that  {A-n)'^-x  is  an  increasing  sequence  of  a-algebras 
because  any  history  of  length  n  -  1  is  the  union  of  the  two  histories  of  length  n  that  can 
follow  it.  Also,  pn  is  clearly  measurable  w.r.t.  An-  The  justification  for  introducing  (^„)  is 
that  we  can  show  that  the  process  {pn^An)  is  a  supermartingale,  which  in  turn  establishes 
the  almost  sure  convergence. 

Lemma  4  (Conditional  Supermartingale)  (pm^n)  '^■^  cl  supermartingale  m  sta.te  H. 
Proof:     We  start  by  verifying  an  equation  which  mysteriously  holds  in  this  example. 

Pnik)      =      {l/2)[F"{Pn{k))  +  F'{pn{k))]Pn^dk) 

+  {l/-2)[{l-F"{Pn{k)))  +  {l-F'{Pn{k)))]PnMk+l)  ^    ' 

for  all  n  and  k  =  0, . . . ,  n  —  1.  We  do  not  claim  that  (8)  holds  beyond  the  current 
specification,  as  we  in  fact  can  show  by  counterexample  that  it  does  not.  But  its  validity 
here  is  easily  verified  it  by  appealing  to  the  closed  form  solutions  of  Lemma  2.  For  instance, 
rewrite  (8)  as 

Pn{k)  -  Pn+xik  +  1)  =  (1/2)  [F^'iPnik))  +  F^{Pn{k))]  (p„+i(A;)  -  PnMk  +  1)) 

But  by  (6)  and  F"{p)  +  F^ip)  =  p^  +  (2p  -  p^)  =  2p,  this  is  equivalent  to 

/I  1      \       2(n- A;)  -  1       2 

which  certainly  holds. 

We  now  prove  that  E\pn+i\H,pn  —  Pn{k)]  <  pn{k),  which  is  the  supermartingale 
inequality  for  .state  H.  Just  rewrite  (8)  as 

Pn{k)      =      \[F"{Pn{k))Pn+l{k)+{l-F''{Pn{k)))Pn^l{k  +  l) 
+      ^  [F'^{Pn{k))Pn^,{k)  +  (1  -  F^Pnik)))  P„+,(fc  +  1)^ 

From  the  closed  form  solutions  we  know  that  Pn+\{k)  >  pn+iik  +  1).  We  also  know  that 
F^  is  always  larger  than  F" .  Therefore,  the  latter  of  the  two  summands  is  larger.  Since 
the  two  terms  average  out  to  give  pn{k),  the  first  term  must  be  smaller  than  pn{k);  ie. 

F"{pn{k))Pn^yik)  +  (1  -  F^(p„(fc)))p„+i  (fc  +  1)  <  Pn{k) 

This  is  exactly  the  statement  that  the  supermartingale  inequality  holds  in  state  H.        0 
Finally,  we  arrive  at  our  stated  goal. 


Proposition  1  (Complete  LeEirning)  Almost  surely,  £„  — ^  0  in  state  H ,  and  1/^^  — >■  0 
in  state  L. 

Proof:  By  Lemma  4,  (p^)  is  a  supermartingale  in  state  H.  The  Martingale  Convergence 
Theorem  therefore  assures  us  that  it  converges  a.s.  to  a  r.v.  p^c-  As  (p„)  is  bounded,  non- 
negative,  and  converges  in  mean  to  0,  we  know  that  a.s.  Poo  =  0.  So  (4)  implies  £„  — )■  0.  0 
It  is  worth  remarking  that  complete  learning  here  does  not  imply  that  eventually  all  in- 
dividuals must  take  the  correct  action;  or,  in  the  terminology  of  SS,  we  need  not  eventually 
enter  a  'correct  herd'.  The  basic  reason  is  the  failure  of  what  we  called  the  'overturning 
principle'.  Suppose  that  one  million  individuals  have  taken  action  1,  and  then  the  next 
individual  takes  action  2.  That  will  hardly  affect  the  likelihood  ratio.  Indeed,  when  indi- 
vidual one  million  and  two  observes  his  summary  statistic  of  history  (that  all  but  one  took 
action  1).  he  will  find  most  likely  the  hypothesis  that  the  individual  who  took  the  contrary 
action  was  one  of  the  early  ones;  therefore,  the  unexpected  action  which  bespeaks  a  signal 
more  powerful  than  the  entire  weight  of  history,  will  have  a  negligible  effect,  simply  be- 
cause the  information  is  not  refined.  In  SS,  we  explain  why  we  cannot  rule  out  an  infinite 
string  of  such  isolated  contrary  actions  —  even  whilst  the  likelihood  ratio  converges  to  0. 

4.   OTHER  MARTINGALES? 

FinalW,  and  partially  in  response  to  the  stated  disbelief  of  some,  we  wish  to  show  that  all 
other  obvious  (and  nontrivial!)  candidates  for  (sub-)martingales  in  the  model  fail  the  test. 
In  particular,  we  shall  consider  the  public  beliefs  and  the  likelihood  ratios.  Note  that  the 
private  beliefs  (p„)  clearly  cannot  be  a  conditional  (sub-)martingale  because  they  are  i.i.d., 
and  cannot  be  an  unconditional  martingale  because  obviously  ||9£'[pn+i  |p„]/(5pn)||  <  1. 
Similarly,  the  realized  posterior  beliefs  (f^)  could  not  possibly  be  a  (sub-)martingale. 
Finally,  it  is  easy  to  show  that  the  likelihoods  (tt^)  is  not  a  (sub-)martingale. 

The  analysis  below  uses  the  calculated  thresholds  and  probabilities  from  figures  1  and  2. 
1.  Public  Beliefs. 

Let  us  start  with  the  sequence  of  public  beliefs,  namely  (p„).  In  a  single  person  experimen- 
tation or  rational  expectations  framework  (see,  for  instance.  Aghion,  Bolton,  Harris  and 
.lullien  (1991)  and  Bray  and  Kreps  (1987)),  as  well  as  the  standard  observational  learning 
paradigm  analyzed  in  SS  and  the  herding  literature,  this  stochastic  process  is  a  martingale 
unconditional  on  the  state  of  the  world.  But  figures  1  and  2  and  Lemma  2  imply  that 

Pi(l)    =    1/4  <  84/384  =  (54/64)(l/6)  +  (10/64)(l/2) 

=    [(3/4)(15/16)  +  (l/4)(9/16)](l/6)  +  [(3/4)(l/16)  +  (l/4)(7/16)](l/2) 
=    {[1  -  pi(l)][l  -  F^(pi)]  +pi(l)[l  -  F^(p0]}p2(2) 

+{[1  -pi(l)]F'^(p:)  +p-i(l)F^(pi)}P2(l) 
=     Pr(#2  invests  |  #1  invested)  p2(2)  +  Pr(#2  doesn't  |  #1  invested)  p2(l) 
=    F[p2  I  A:(l)  =  1] 

Hence,  the  public  beliefs  are  not  an  unconditional  martingale.  If  one  considers  the  history 
where  #1  has  moved  and  not  invested,  one  can  show  that  public  beliefs  are  also  not  an 
unconditional  supermartingale. 

10 


2.  Likelihood  Ratios. 

Next  consider  the  likelihood  ratio,  which  in  SS  is  a  martingale  conditional  on  state  H.  We 
shall  start  at  the  very  same  history  as  above,  where  £  has  the  value  1/3.  In  state  H  there 
is  a  probability  of  15/16  that  its  next  value  is  1/5,  and  a  probability  of  1/16  that  its  next 
value  is  1.  In  expectation  we  get  1/4  <  1/3.  Similarly,  by  starting  with  #1  investing,  we 
can  show  that  it  is  not  a  supermartingale.  Finally,  unconditional  on  the  state,  it  is  easy 
to  rule  out  a  (sub-)martingale. 

A.  OMITTED  PROOF  OF  LEMMA  2 

Lemma  2  (Closed  Form  Solutions)     For  all  n  >  I,  fc  =  0, . . . ,  n  —  1  xue  have 

2{n-k)  -1 


Pn{k)    = 


2n 


^H(l.\  2n         1  ^4-4n 

""^^)     =     2(n-A:)-l2 

Proof:     Case  1:   k  =  Q  ok  k  =  n- I. 

This  is  particularly  simple  because  everyone  can  exactly  deduce  what  all  predecessors  knew 
and  did.  Consider  first  A;  =  0.  We  shall  proceed  by  induction  on  n.  It  is  readily  seen  that 
the  formulas  are  correct  for  n  =  1:  namely,  Pi(0)  =  1/2  and  7rf^(0)  =  1.  Now  assume  that 
it  holds  for  arbitrary  n  >  1,  and  we  shall  prove  it  holds  for  n  +  1  (still  with  k  —  Q). 

Individual  n  +  1  is  thus  told  that  none  of  the  first  n  individuals  invested;  therefore,  he 
can  then  conclude  that  individual  n  had  been  told  that  none  of  the  first  n  —  1  individuals 
invested.  So  p„  =  (2n  —  l)/(2n),  and  by  (4),  £„  =  2n  —  1.  Individual  n  took  action  2. 
From  the  viewpoint  of  individual  n  +  1,  there  was  in  state  H  a  probability  of  F"{pn)  that 
this  could  happen,  and  in  state  L  a  probability  of  F^{pn)-  Thus  the  next  likelihood  ratio 
is 

in+l      =      LF''{Pn)/F"{Pn) 

/2(2n-l)       (2n-l)2\   /(2n-l)2 

—      t-n  1 


2n  (2n)2     ;  /      (2n)2 

,J2n-l)(2n  +  l)       An^ 

which  by  (2)  yields  pn+i  =  [2(n  +  1)  -  l]/2(n  +  1),  as  desired.  Also,  equation  (5)  and  the 
inductive  hypothesis  yields 

d(0)    =    7rf(0)F^(p-„(0)) 

^    o4-4n/2(n-l)V(2n-l)^ 


2 


\   n-l    J       (2n) 

,4,4„       (2n-2)!  (2n-2)!        (2n  -  1)^  ^  ^-4" 

(n-l)!(n-l)!(n-l)!(n-l)!     (2n)2 


(2n)!^- 
nln\ 


For  k  =  n  —  I  just  apply  Lemma  1. 
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Case  2:  l<  k  <n-2. 

Again  we  shall  use  induction  on  n.   The  formula  is  true  for  7^  =  2,  the  first  nonvacuous 
case,  as  P3(l)  =  1/2  and  Trf  (0)  =  5/32  follows  from  figures  1  and  2.  Next  we  shall  assume 
that  the  formulae  hold  for  7i,  and  establish  them  for  n  +  1. 
We  start  by  the  induction  for  ir" .  First  rewrite  (5)  as 

d(^-)     =    7r^{k-l)[l-F"{p,,{k-l))]    +  7r:!{k)F"{Pr.ik)) 


-^"{k-l) 


'■2{n-k)  +  V 
2n 


+  ^'^{k) 


'2{n-k)-V 
2n 


-4n- 1-4^2  +  8^^  +  4^  (2{n-k)-lf    „,,, 

-K^{k  -  1)  +  ,„; -^nik) 


An' 


An? 


teTm\  term2 

Now,  using  the  hypothesized  formula  for  ir^^ik  —  1),  we  can  rewrite  term  1  as 

(477-2^-  +  l)(2A:-l)_„,,^ 

72 ^n  [k  -  1) 


An' 

(An  -2k  +  l){2k  -  l)(2n  -  1)  f2k  -  2\  /2(n  -  k)' 
4n2(2(n-fc)  +  l)24"-4        V  A;  -  1  j  V  n-k   ^ 

{An  -2k  +  l)k{2n  -  1)  f2k\  (2{n  -  k)\  (2n  -  2\ 


'2n  -  2' 
.  n-  I  J 


4n2(2(n  -  k)  +  l)24"-3  \k  J\   n-k   j 

{An  -2k  +  l)k     (2k\  /2(n  -  k)\  /2n> 
n(2(n  -  fc)  +  1)24"  \  A;  j  V   „  _  j^   )\nj 


n-  1 


{An-  2k  +  l)k 


(2(n  +  l)-l) 


n(2n  +  l)       (2(n  +  l-A;) -l)24("+i)-4 

and  term  2  like  this: 

(2(n-A:)-l)2(2n-l)  /2k\  (2{n  -l-k)\l 
4n2(2(n  -  k)  -  l)2''"-4  \kj\  n-l-k  )\ 

n-k  (2k\  /2(n  -  k)\  (2n\ 
n24"  \k)\  n-k   )\n] 

{n  -  k){2{n  -  k)  ^  \) 


'2k^ 


%n  +  1  -  1  -  it)' 
,    n  +  1  -  1  -  A:    , 


'2n^ 

,  n  ; 


2n-2^ 
n  —  I  J 


(2(n  +  l)-l) 


_  f2k\  /2(n  +  1  -  1  -  fc)\  j2n' 
(2(n  +  1  -  A;)  -  l)24("+i)^  (^  A:  j  (^  n  +  l-l-k   )[n^ 


n(2n+l) 

Finally,  adding  terms  1  and  2  provides  the  desired  expression  for  n"j^i{k)  because 

(4n-2A:  +  l)A:       (n  -  A:)(2(n  -  A:)  +  1)  _ 


7i{2n  +  1) 


n{2n  +  1) 
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We  now  need  only  manipulate  the  formula  for  ^^"_^_^  and  (3)  to  establish 

7r„^^i  in  +  l-k)  _  irr  _  2(n  +  I  -  k)  -  1 


Pn+l{k) 


TT^^An  +  I  -  k)  +  n^^.ik)        |^  +  ^^^a±^  2(n  +  l) 

0 
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